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Abstract 



O; 

Q^) | This article establishes sufficient conditions for a linear-in-time bound on the non-asymptotic 

variance for particle approximations of time- homogeneous Feynman-Kac formulae. These formulae 

appear in a wide variety of applications including option pricing in finance and risk sensitive 

control in engineering. In direct Monte Carlo approximation of these formulae, the non- asymptotic 

variance typically increases at an exponential rate in the time parameter. It is shown that a linear 

bound holds when a non-negative kernel, defined by the logarithmic potential function and Markov 

00 . kernel which specify the Feynman-Kac model, satisfies a type of multiplicative drift condition and 

OO ■ 

other regularity assumptions. Examples illustrate that these conditions are general and flexible 
£C) ' enough to accommodate two rather extreme cases, which can occur in the context of a non-compact 

<3Q ■ state space: 1) when the potential function is bounded above, not bounded below and the Markov 

kernel is not ergodic; and 2) when the potential function is not bounded above, but the Markov 
kernel itself satisfies a multiplicative drift condition. 

Keywords: Feynman-Kac Formulae; Non- Asymptotic Variance; Multiplicative Drift Condition. 

X 

H . 1 Introduction 

& . 

On a state space X endowed with a cr-algebra B (X) let M be a Markov kernel and let U : X — > M 
be a logarithmic potential function. Then for i£X, consider the sequence of measures {jn^] n > 1} 
defined by 



7n,z (<P) '■= E J 



fn-X 



exp £ U (X k ) V (A„) 



\k=Q 



(1.1) 



for a suitable test function <p and where E x denotes expectation with respect to the law of a Markov 
chain {A„; n > 0} with transition kernel M, initialised from Xq = x. 

Feynman-Kac formulae as in (|1.1|) arise in a variety of application domains. In the case that U is 
non-positive, the quantity 7,^ (1) can be interpreted as the probability of survival up to time step n of a 



1 



Markovian particle exploring an absorbing medium [Del Moral and Miclol . 120 03. Del Moral and Doucetl . 
2004|; the particle evolves according to M and at time step k it is killed with probability 1 — 
exp (U (Xk))- Another application is the calculation of expectations at a terminal time with respe ct to 



Jasra and Doucet 



2009]). In 



jump-diffusion processes which may or may not be partially observed (e.g. 
particular, for option pricing in finance, there are a variety of options, (e.g. asian, barrier) which can be 
written in the form (jl.lll where the potential function arises from the pay-off function/change of mea- 
sure and the Markov kernel specifies finite dimensional marginals of some partially observed Levy pro- 



cess (e.g. lJasra and Del Morall 201l| ). It is remarked that in this latter example, the finite dimensional 
marginals can induce a time-homogeneous Markov chain that is not necessarily ergodic. Furthermore, 
functionals as in (jl.ljl arise in certain stochastic control problems, where one considers the bivariate 
process {X n = (Y n , A n );n > 0} with Y n being a controlled Markov chain and {A n ;n > 0} a control 
input process. In some cases the transition kernel M can be expressed as Mi(y n , da n )M2(y n , o, n , dy n +i) 
with Mi corresponding to the control law or policy and Mi to the controlled process dynamics. In a 
risk-sensitive optimal control framework ■^\og'y n , x (1) arise s as a cost function one aim s to m inimise 



with respect to an appropriate class of policies; see (Whittle! . ll99dllDi Masi and Stettnerl . ll999j for de- 
tails. In such problems it is common to choose U(y, a) to be unbound ed from above, e.g. U is usually 



chosen to be a quadratic for linear and Gaussian state space models Whittle, 1990]. Mor e generally 



(11.11) arises as a special case of a time-inhomogeneous Feynman-Kac formulae studied by 



2004]. 



Del Moral 



The non-negative kernel Q(x,dy) := exp (U(x)) M{x, dy), defines a linear operator on functions 
Q ((p) (x) := J Q (x, dy) ip (y) and (ll.l[) can be rewritten as j njX (ip) = Q n (<p) (x), where Q n denotes 
the n-fold iterate of Q. In the applications described above, the Feynman-Kac formulae (jl.ip typically 
cannot b e evaluated analy tically. However, they may be approximated using a system of interacting 
particles 



Del Moral 



2004] . These particle systems, also known as sequential Monte Carlo methods in 



the computational statistics literatu re (e.g. 
intensive study, see amongst others 



Chopin et al 



2011 



Del Moral et al 



Doucet et 



Crisan and Bain 



d 



2008, 



2001 1), have themselves become an ob j ect of 



Del Moral et al 



2009 



van Handell . 



2009 



2011] and references therein for recent developments in a variety 



of settings. 

The present work is concerned with second moment properties of errors associated with the particle 
approximations of {^ n ,x\- In order to obtain bounds on the relative variance, we control certain tensor- 



product functionals of these particle approximations, recently addressed by ICerou et al.l [201 1| . using 
stability properties of the operators {Q n ;n > 1}. These stability properties are themselves derived 
from t he multiplicative ergodic and spectr al theories of linear operators on weighted oo-norm spaces 
due to Kontoviannis and Mevn 2003 , EoO^ ; this is one of the main novelties of the paper. By doing so 



we obtain a linear-in-n relative variance bound under assumptions on Q which are weaker than those 
relied upon in the literature to date and which readily hold on non-compact spaces. Furthermore, to 
the knowledge of the authors, these are the first results which establish 

• that a linear-in-n bound holds under conditions which can accommodate Q defined in terms of 
a non-ergodic Markov kernel M, 

• that any form of non-asymptotic stability result for particle approximations of Feynman Kac 
formulae holds under conditions which can accommodate U not bounded above. 



2 



1.1 Interacting Particle Systems 

Let N £ N be a population size parameter. For n £ N, let := jcL^'^; 1 < i < ivj be the ra-th 
generation of the particle system, where each particle, , is a random variable valued in X. Denote 



1 



77^ := — J2iLi ^m<) • The generations of the particle system ^Ci^ ; > j form a X^-valued Markov 
chain: for x £ X, the law of this chain is denoted by and has transitions given in integral form by: 



3 ,iV 



K (c 

(( ( n N) edy 



N 



r e dy 



i=l 
N 

n 



n > 1, 



(1.2) 



where dy = d (y 1 , . . . y w ) , 1 is the unit function and for some test function 77^ (<^?) := — V ^Cn^'*^ 

(here the dependence of rj^ on a; is suppressed from the notation). These transition probabilities cor- 
respond to a simple selection-mutation operation: at each time step N particles are selected with 
replacement from the population, on the basis of "fitness" defined in terms of e u , followed by each 
particle mutating in a conditionally-independent manner according to M. 
The empirical measures {7^ x ; n > 0}, defined by 

n-l 
k=0 



Del Moral 



and := 5 X , are taken as approximations of {j n ,x}- It is well known 
that 

where denotes expectation with respect to the law of the TV-particle system. 



2004 Chapter 9] 



1.2 Standard Regularity Assumptions for Stability 



Rece nt work on analysis of tensor product functionals associated with {7,^.; n > 0}, [Del Moral et al 



200{J, has lead to important results regarding higher moments o f the error associa ted with these 

have proved a 



Cerou et al 



|2011 | 



particle approximations; in a possibly time-inhomogeneous context 
remarkable linear-in-n bound on the relat ive variance of 7^ (1). In the context of time- homogeneous 



Feynman-Kac models, the assumptions of 



Cerou et al 



2011 



are that 



sup U(x) < 00 
and that for some mo > 1, there exists a finite constant c such that 

Q mo (x, dy) < cQ mo (x', dy) , V(i,i')eX 2 . 



(1.3) 



(1.4) 
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The result of 



Cerou et al 



20111 is then of the form: 



N > c(n + 1) 



E 



/v 



7n,x(l) 



- 1 



<c-(n + l). 



\fx e X. 



(1.5) 



where c is as in (|1.4j) . The efficiency of the particle approximation is therefore quite remarkable: a 
natural alternative scheme for estimation of 7 n , x (l) is to simulate N independent copies of the Markov 
chain with transition M and approximate the expectation in (jl.ll) by simple averaging, but the relative 
variance in that case typically explodes exponentially in n. The restriction is that (jl.4p rarely holds 
on non-compact spaces. The present work is concerned with proving a result of the same form as 
(|1.5|l under assumptions which are more readily verifiable when X is non-compact. The main result 
is summarized after the following discussion of (|1.3[) - (|1.4[) and how they relate to the assumptions we 
consider. 

The condition of (|1.4|) and its variants are very common in the lit erature on exponential stability 
of nonlinear filters and t heir p article approximations, see for example 



Del Moral and Guionnet 



2001 



Le Gland and Oudiand . l2004j and references therein. It can be interpreted as implying a uniform 



bound on the relative oscillations of the total mass of Q mo , i.e., 

Qm (1) (g) v(t i'kX 2 



(1.6) 



and this is very useful when controlling va rious functionals which arise when analysing the relative 
variance as in (|1.5p . (see 



Cerou et al 



2011 



Proof of Theorem 5.1). However one may take the inter- 
pretation of (|1.4p in another direction: it implies immediately that there exist finite measures, say /? 
and v, and e > such that 



(x,dy)<P(dy) 



im 



(x, dy) > eu (dy) , Va; G X. 



(1.7) 



In the case that U = (i.e Q = M is a probabilistic kernel) and M is ■(/'-irreducible and aperiodic, 
this type of minorization over the entire state space X implies uniform ergodicity of Q, which is 
in turn equivalent to Q s atisfying a Foster-Lyapunov drift condition with a bounded drift function 



Meyn and TweedieL I2009L Theorem 16.2.2]. In the scenario of present interest, where in general 
U 7^ 0, one may take V : X — > [1, oo) to be defined by V(x) = 1, for all x, and then when ()1.3|) holds, 
it is trivially true that there exists 5 € (0, 1) and b < oo such that Q satisfies the multiplicative drift 
condition, 



Q(e V ) 



< 



(1.8) 



where Ix is the indicator function on X. Q may then also be viewed as a bounded linear operator on the 
space of real-valued and bounded functions on X endowed with the oo-norm, which is norm-equivalent 



[in the sense of iMevn and TweedieL 120091 p. 393] to 
weighting function. 



I VII 



su Pxex ' 



with V any bounded 



exp V(x) 

As explained in the next section, the interest in writing (|1.7p - (|1.8p is that conditions expressed 



4 



in this manner have natural generalisations in the context of weighted oo-norm function spaces with 
possibly unbounded V. 



1.3 Setting and Main Result 



Del Mora 



2004 



(e.g. Chapter 4 and Section 12.4)] and 



Del Moral and Doucet 120041 address the 



setting in which {Q n ] n > 1} is considered as a semigroup of bounded linear ope rators on the Banach 
space of real- valued and bounded functions on X, endowed with the oo-norm, and 



Del Moral and Miclo 



2003| address the L2 setting, connecting stability properties of the measures {^ ntX } and their normal- 



ize d counterparts to the spectral theory of bounded linear operators on Banach spaces. 



Kontoviannis and Mevn 



2003 



20051 ] have developed multiplicative ergodic and spectral theories of 
operators of the form Q in the setting of weighted oo-norm spaces; a function space setting which has al 



ready proved to be very fruitful for the study of general state-space Markov c hains Mevn and T weedie. 



2003, 



2009, Chap ter 16] without reversibility assumptions. The reader is referred to [Kontoviannis and Mevn 



2005| for extensive historical perspective on this spectral theory and related topics , including 



(of pa rticular relevance in the pr esent context) the theory of non-nega t ive op e rators due to 



20041 . Chapter 5]. The work of Kontoviannis and Mevnl . 



2003 



2005 



Nummelin 



Mevnl . l2006j is geared towards 



large deviation theory for sample path ergodic averages n _1 X)fc=o U(Xk) under the transition M and 
in that context it is natural to state assumptions on M and U separately. By contrast, when studying 
the particle systems described above, we are not directly concerned with such sample paths, but rather 
the relationship between the proper ties of the particle approximations } and their exact counter- 



2003 



2005^ will be applied to this effect, 



parts {"f n ,x}- Some of the results of iKontoviannis and Mevnl 
but starting from assumptions expressed directly in terms of Q which reflect the scenario of interest. 

The core assumptions in the present work (see Section |2~21 for precise statements) are that for some 
constants uiq > 1, S € (0, 1) and all d > 1 large enough, 



lm (x, dy) 
Q(e V ) 



> tdVd (dy) , Vie e C d , 

< e V(l-S)+b d lo d 



(1.9) 
(1.10) 



with V unbounded and Cd := {x :V(x) < d} C X a sublevel set. It is noted that one recovers the 
minorization and drift of (|1.7[) - (11.8p in the case that V is bounded and Cd = X. We will also invoke 
a density assumption which is weaker than the upper bound in (ll.7p . It will be illustrated through 
examples in Section [4] that ()1 .9|) - (|1 . 10[) can be satisfied in circumstances which allow M to be non- 
ergodic. Furthermore, it will also be demonstrated that, in contrast to (|1.3p . conditions (|1.9|) - (|1.10|) can 
be satisfied with U not bounded above, subject to strong enough assumptions on M and a restriction 
on the growth rate of the positive part of U . 

The main result obtained in the present work (Theorem 13.21 in Section |3|) is a bound of the form: 



N > a (n+ 1) > <f>(x) 



E 



7^(1) 

7ra,x(l) 



- 1 



< c 2 — (n + 1) ■ 
~ N 



,2+r 



(x) 



K{x) 



5 



with 



<j>(x) 



Cl 



v(x) 



h (x) 



1 



where v(x) = e v ^ x \ Bo, B\, c%, c-i are constants which are independent of N, n and x and for a real 
number a we denote as \a] the smallest integer j such that j > a. In this display ho is the eigenfunction 
associated with the principal eigenvalue of Q and the constant B\ is directly related to the size of the 
spectral gap of Q. Verification of the existence of ho along with various other spectral quantities plays 
a central role in t he proofs. 



We note that 



Del Moral and Doucet 



2004], 



Cerou et al 



201 1| also consider the case in which 



expU(x) may touch zero and the former are also directly concerned with approximation of the eigen- 
value A corresponding to ho via the empirical probability measures {77^}. These issues are beyond 
the scope of the present article but the study of these and re lated issues in a m ore general time- 
inhomogeneous setting is underway. It is also remarked that ICerou et alj 201 lj consider a more 



general type of particle system, which involves an accept/reject evolution mechanism. The approach 
taken here is also applicable in that context, but for simplicity of presentation we only consider the 
selection- mutation transition in (jl.2l) . 

The remainder of the paper is structured as follows. Section [5] is largely expository: it introduces 
various spectral definitions and the main assumptions of the present work and g oes on to show how these 
assum ptions validate the application of multiplicative ergodicity results of iKontoviannis and Mevn 
2005) . It is stressed that much of the content of this section is included in order to make clear 



the simila rities and differences between the setting of interest and the main stated assumptions and 
results of 



Kontoviannis and Mevn 



20051 ] . Section [3] deals with the variance bounds for the particle 



approximations. Numerical examples are given in Section 2J Many of the proofs of the results in 
Section [5] are in Appendix |XJ Some proofs and lemmas for the results in Section [3] can be found in 
Appendix [Bj 



2 Multiplicative Ergodicity 
2.1 Notations and Conventions 

Let X be a state space and B(X) be an associated countably generated tr-algebra. We are typically inter- 
ested in the case X = K d -, d x > 1, but our results are readily applicable in the context of more general 
non-compact state-spaces. For a weighting function v : X — > [1, 00), and ip a measurable real-valued 
function on X, define the norm H^H^ := sup^gx |^(a;)| /v(x) and let L v :— {ip : X — > R; H^H^ < 00} be 
the corresponding Banach space. Throughout, when dealing with weighting functions we employ an 
lower/upper-case convention for exponentiation and write interchangeably v = e v . 

For K a kernel onXxB (X), a function ip and a measure /i denote '■— J ip(x)[i(dx), Kip(x) :— 
J K(x,dy)ip(y) and fJ-K(-) := J [j,(dx)K(x, •). Let V be the collection of probability measures on 
(X, B(X)), and for a given weighting function v : X — [1, 00) let V v denote the subset of such measures 



G 



H such that fx(v) < oo. For n > the n-fold iterate of K is denoted: 

K := Id, K n := K_^_K, n > 1. 

n times 

The induced operator norm of a linear operator K acting C v — > C v is 



\K\\\ V := sup 



t ^ € C v , \\<p\\ v ? o j = su P {||^L ; ^ e £„, M < «} . 



The spectrum of K as an operator on C Vl denoted by S V (K), is the set of complex z such that 
[Iz — K] 1 does not exist as a bounded linear operator on C v . The corresponding spectral radius of 
K, denoted by £ V (K), is given by 

UK) :=sup{\z\;zeS v (K)}= lim |||tf n ||li /n , 

n— J-oo 

where th e limit always exists by subadd itive arguments, but may be infinite. The following definitions 
are from 



Kontoviannis and Mevn 



2005|. 



A pole zq € S V (K) is of finite multiplicity n if 

— for some t\ > we have {z € S V (K); \z — zq\ < e{\ — {zq}, 

— and the associated projection operator 

J-^-J [Iz-K^dz 



2ni 



d{z:\z-z \<€ 1 } 



can be expressed as a finite linear combination of some {s^ C C v and {z/j} C V v , 

n-l 

where [s^ (g) i/y] (a;, <i?/) = Si{x)vj{dy). 

• if admits a spectral gap in if there exists eo > such that S v (if) n {z : |z| > (if) — cq} is 
finite and contains only poles of finite multiplicity. 

• if is v-uniform if it admits a spectral gap and there exists a unique pole A <E S v (if) of multiplicity 
1, satisfying |A| = (X). 

• if has a discrete spectrum if for any compact set i? C C \ {0}, £„ (if) n B is finite and contains 
only poles of finite multiplicity. 

• if is v-separable if for any e > there exists a finite rank operator if ^ such that \\\K—K^ \\\ v < e 
2.2 Multiplicative Ergodic Theorem 



In thi s section we present the main assumptio ns and state some results from iKontoviannis and Mevn 



2005j (see also IKontoviannis and Mevnl [2003]). 
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2.2.1 Assumptions 



(HI) The semigroup {Q n ;n > 1} is -0-irreducible and aperiodic (see iMevnl 20061 Section 2.1]). 



(H2) There exists an unbounded V : X —> [l,oo), constants mo > 1, S G (0,1) and d > 1 with the 
following properties: 

For each d > d and Cd '■= {x e X; V(x) < d}, 

• there exists € (0, 1] and Vd € V v such that Cd is (mo, £d, ^d)-small for Q, i.e., 



Qmo(^>") > ICrf^ed^dC-): Vx € X, 



(2.1) 



with Vd (Cd) > 0. Furthermore Q m „ (Cd) (x) > for all ieX. 
• there exists bd < oo such that the following multiplicative drift condition holds, 



Q(e v ) < e ni-*)+Mc d . 



(2.2) 



(H3) U : X -> R is such that 



[/+ := max (C7, 0) € £y- 



(H4) There exists to > 1 and for each d> d there exists a measure such that ^ (e v ) < oo and 

(X ta e A, r C5 > to) < /?d (^) , x € C d , A G B(X), 
where denotes the law of the Markov chain {X n } with transition M and ta ■— inf {n > 1 : X n G ^4}. 

Remark 2.1. We take care to emphasize the following differe nces and similarities between the above 
assumptions and the setting of iKontoviannis and Mevn 



Assum ption (1© equation (|2.2p applies directly to the Q kernel, whereas IKontoviannis and Mevn 
|2005l | impose a multiplicative drift condition on M. The key issue is th at the multiplicative drift 
condi tion for Q is the essential and implicit ingredient of Lemma B.4 of IKontoviannis and Mevn 



2005] , and as we shall see in Section @J under the conditions that U is bounded above but 
not bounded below, assumption (F (2^ can hold w ithout geometric drift assumptions on M. A 



related phenomenon is considered by IMevnl [2006] in order to obtain "one-sided" large deviation 
principles for ergodic sample-path averages for the chain with transition M. 

Assumption (EGJ requires the sublevel sets of V to be small for Q and this is exploited in Lemma 
IA.ll The explicit mo-step minorisation con dition makes it easy to bound below the spectral 
radius of Q, see Lemma l2.ll In the setting of Kontoviannis and Mevn 2003^ the spectral radius 
of Q is bounded below by 1 as U is assumed centered with respect to the invariant probability 
distribution for M . In the present context, this centering assumption is unnatural, especially as 
we want to consider some situations where such an invariant probability does not exist. 
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Assu mption (HHJ) is weaker than the corresponding assumption in the st atement of [ Kontoviannis and Mevn 



2005 . Theorem 3.1]. However, (I^3j) coincides with the first part of 



Kontoviannis and Mevn 



20051 Equation 73], which combined with (HI]), and (L0J in Lemma [2.21 below, is enough 

to prove that Q has a discrete spectrum in C v . 



Kontoviannis and Mevnl . 120051 Theorem 3.4] and jKontoviannis and Mevn 



200. 



31. 



As shown in 

a MET can be proved without (HI]), but at the cost of restrictions on the class of functions to 
which U belongs which are a little unwieldy. 



2.2.2 Results 

We now give a collection of results which are used to prove the MET, Theorem 12.21 The proofs are 
given in Appendix [A] It is remarked that the steps in the proof of Theorem 12.21 are effectively the 
same as part of the proof of Theorem 3.1 of iKontoviannis and Mevnl 2005J, however, our starting 
assumptions are stated differently. 

The following preparatory lemma establishes that the Feynman-Kac formula (|l.lj) is well defined 
and presents bounds on the spectral radius of Q. 



Lemma 2.1. At 



and for all d > d, 



where d is as in 



Then for all x S X, n > 1, ip G C v , 

hn,x 0)1 < °o, 



tdVd {C d ) < (Q) < oo, 



(2.3) 



(2.4) 



To clarify how assumptions (E(T])-(]rG| connect with the results of IKontoviannis and Mevnl 2005 1 
we next present a lemma regarding the w-separability of Q which is a stepping stone to the MET. 
Observe that the multiplicative drift condition (H2]) implies that Q can be approximated in norm to 
arbitrary precision by truncation to the sublevel sets of V, in the sense that for any r > d, 



I C cQ(e y ) < e 



V-Sr 



(2.5) 



and then with Q ( r ) := Ig^Q, it follows immediat ely that |||Q — Q^|||„ < e Sr . In the following lemma, 



which combines Kontoviannis and Mevnl . 120051 Lemmata B.3-B.5] and is included here for complete- 
ness, the density assumption (H3]) pl & y s a key role in establishing that iterates of this truncation of Q 
can be approximated by a finite rank kernel. 

Lemma 2.2. Assume (Iflty-(I^). Then Q2t +2 is v-separable, where to is as in (I^4ty. 

The following theorem makes a key connection between w-separability and a discrete spectrum. 



Theorem 2.1. JKontoviannis and Mevn , \200a . Theorem 3.5} If the linear operator Q : C v — » C v is 
bounded and Q ta is v-separable for some to > 1, then Q has a discrete spectrum in C v . 
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Under (Ej2j) Q is indeed bounded, so has a discrete spectrum in C v and then by definition it also 
admits a spectral gap in C v . For any > £ v {Q) we may consider the resolvent operator defined by 



R e := [16 - QY 1 =5^1 



!-fc-l/ 



(2.6) 



fc=0 



We can now state and prove the MET: 



Theorem 2.2. Assume (HJty-(H^). Then X = £„ (Q) is a maximal and isolated eigenvalue for Q. For 
any d> d and 9 > (, V (Q), the operator Hg d defined by 



IX g - (Re - 6(- m ^e d l Cd ®v d )] = ]T A^ 1 ( 



Re-' 



(— mo — 1 



k=0 



] e d l Cd ®v d ^ , (2.7) 



is bounded as an operator on C v> with Xg := (9 — £ v {Q)) ■ 
The function ho £ C v and measure fio € V v defined by 



ho := 



Mo-SmCW 
are independent of 9, d and satisfy 

Qh Q = Xh , 



Mo := 



v d He, d 
VdH e ,d (1) ' 



(2.8) 



Mo {ho) = 1. 



Furthermore, there exist constants Bo < oo and B\ > smc/i i/iai /or any ip £ £ v , any n > 1 and 
any x £ X, 



|A ™7n,x(<p) - /K)(x)^ (<p)| < IML-Bo 



,-nfli 



i;(x). 



(2.9) 



Proof . We give only a sketch proof, as it is essentially that of Theorem 3.1 of iKontoviannis and Mevn 
|2005| . As established in Lemma |2.1[ under our assumptions < {Q) < 00. Furthermore the 
semigroup associated with Q is ^-irreducible, and as observed above Q i s bounded on C v , has a d i screte 



Kontoviannis and Mevn 



spectrum and therefore admits a spectral gap in C v . Proposition 2.8 of 
therefore applies. Thus Q is w-uniform and A = £ V {Q) is a maximal and isolated eigenvalue. 

By the minorization condition of (E(2]) one can obtain a minorization condition for Rg of (|2.6 



2005 1 



Rg{x,dy) > $<■ 



-m -l) 



tdlc d { x ) v d{dy), 



which holds for any d > d,9 > £ V {Q). Therefore by the argument in IKontoviannis and Mevn 
2005| [Proof of Proposition 2.8], for any 9 > £ V {Q) and d > d, the spectral radiue of [Re-9^- m °- l h d lc d ® 



v d \ is strictly less than Xg = {9 — £ v {Q))~ . Thus Hg. d is bounded as an operator on C v and the sum 
in (|2.7|) converges in the operator norm. 

Then also by Kontoviannis and Mevn . 2005$ [Proposition 2. 81, He. d {Ig^) € £„ is an eige nfunc- 
tion for Q with eigenvalue A = £ v {Q). By similar arguments to IKontoviannis and Mevnl 2003] [proof 
of Proposition 4.5] it is easily verfied that v d Hg^ d is an eigenmeasure. The normalization to ho and /xg is 
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justif ied by the finiteness, under our assumptions, of the associated quantities. By Kontoviannis and Mevn 
l20od | [Theorem 3.3 part (iii), see also comments on p. 332] ho and /iq constructed using any 9, d are 
respectivaly the ^-essentially unique eigenfunction and unique eigenmeasure satisfying /^o(X) = 1, 
Mo(^o) = 1: hence the lack of dependence on 9, d. 
To obtain (|2.9p one may define the twisted kernel: 



P(x,dy) := A 1 h 1 (x)Q(x,dy)h (y), 



(2.10) 



which can be seen to be well defined as a Markov kernel, as A is strictly positive and finite and (EEJ 
implies ho is everywhere finite and strictly positive. Furthermore one observes immediately that P 
admits fr, defined by ^{^f) = /i {hoip) //x (ho) = [i (hpLp ), as an invariant probab i lity d istribution. By 
Lemma lA.ll in Appendix lAlone can apply Theorem 3.4 of lKontoviannis and Mevn 2005] t o the Markov 
chain associated to the twisted kernel 
ip/ho, F 



Kontoviannis and Mevn 



20051 ] . take g 



in the notation of of Theorem 3.4 of 
0). This results in the bound (]2.9I) . which completes the proof. 



□ 



Remark 2.2. Upon dividing through by ho, the equation (|2.9p of the MET may be viewed as a proba- 
bilistic, geometric ergodic theorem for the twisted chain associated to the kernel (]2.10]) and the modified 
test function <p/ho, with a naturally modified drift function v — e v proportional to v/ho. See Lemma 
IA. II in Appendix lAl 

Remark 2.3. The constant B\ in e quation (|2.9p is directly related to the size of the spectral gap of Q, 



see 



Kontoviannis and Mevn 



20031 Proof of Theorem 4.1]. 



3 Non- Asymptotic Variance 
3.1 Tensor Product Functionals 

The various tensor product functionals considered in the remainder of this paper require some addi- 
tional notation. For a measurable function F on X 2 and a weighting function v : X — > [1, oo), we define 
the norm \\F\\ V 2 := sup^ yeX2 \F(x, y)\ / (v(x)v(y)) and denote £„ i2 := j-F 1 : X 2 — > R; 2 < oo| the 
corresponding function space. For two functions ipi,(f2 G C Vl we denote by tp\ ® if2 € C v ,2 the tensor 
product function defined by ipi (g) ifi2(x, x') := ipi(x)<p2(x'). Let K : X x B(X) — > R + be a kernel on X. 
The two-fold tensor product operator corresponding to K is defined, for any F G C Vt 2, by 

K® 2 (F)(x,x') := / K(x,dy)K(x',dy')F(y,y'). 

The iterated operator notation of the previous section is carried over so that 

Kf 2 := Id, K® 2 K® 2 . . . K® 2 , n>l. 

n times 

Corresponding to the particle empirical measures of section [TTT1 for n > 1, we introduce the tensor 
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product empirical measures (or 2-fold V— statistic): 



Following the definition of 
tions on X 2 , is defined by 



l<i,J<JV 



AT\«>2 



Cerouetal 



201l| . the coalescent integral operator D, acting on func- 



D(F)(x,x') =F(x,x), (x,x')eX 2 



For any < s < (n + 1), we denote by T n ,s '■= {(ii, ■■•>ia) € Nq;0 < i\ < . . . < i s < n} the set of 
coalescent time configurations over a horizon of length n + 1 and for ...,i s ) G I„. s and igX, the 
nonegative measure r^a" on (X 2 ,£?(X 2 )), and its normalised counterpart Tn*x "' ts \ are defined by 



p(*i 



lf^DQf_ i D..,Qf- 



, D Qt-i s ' 



p(*l,...i„) 



r (*i.- 



7n,,(l) 2 ' 



(3.1) 



for s > 1, and for s = 0, (F) := 7 ® 2 (F) and r T ( ^ (F) := ry® 2 (F). We refer the reader to 



Cerou et al 



2011 



Section 3] for a helpful visual representation of the integrals in the transport 
equation (|3.ip . We have already checked in Lemma 12.11 that the Feynman-Kac formula (jl.ip is well 
defined under our assumptions in the C v setting, which validates the denominator of (|3.1[) . 

When Theorem 12.21 holds . we will denote by E^ expectation with respect to the law of the twisted 
Markov chain {X n ; n> 0}, i.e that with transition kernel P as in equation (|2 . 1 0|) and initialised from 
X = x. 



3.2 Non- Asymptotic Variance 

In this section we give our main result. The proof is detailed in section T3.3I The following additional 
assumption imposes some further restrictions on the function class considered, but this is not overly 
demanding, considering that we will be dealing with coalesced tensor product quantities. 

(H5) Let V and d be as in assumption (EJ2]). There exists < eo < e and for all d > d, there exists 
b* d < oo such that 

Q L{l+e)v\ < e (l+e)V-(l+e )V+b* d l Cd _ 



The following theorem is due to 



Cerou et al 



2011 1 . 



Theorem 3.1. 

expansion holds: 



Cerou et all \201j\ . Proposition 3.4] For any n > 1, x € X and N > 1 the following 



E 



N 



7^(1) 

7n,x(l) 



- 1 



n+1 

EC 

8=1 



^ \ (n+l)-« j 



N 



E 



=\(i 1 ,...i„) 



(1® 1) - 1 



(3.2) 



(ii,—,i,)£l n , s 

where denotes expectation w.r.t. the law of the N -particle system. 
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A full proof is not provided here. However, we note that we may write 



E 



N 



E 



JV 



(7^) 82 (1®1) 



7«,x(l) 2 



(3.3) 



where the equality is due to the lack of bias property E^ [7^(1)] = 7n,x(l) and the definition of 
{.In x)^ 2 ■ I n summary, the proof of Theorem 13. II involves recursive calculation of the expectation on 
the right of (13.31) , follow ed by organisation of the resulting terms into the form (|3.2p . The reader is 



directed to ICerou et al 



201 1| for the details. 



It is remarked that there is a different error decomposition in [Chan and Lai l201l| , which can hold 
to any order under appropriate regularity conditions; one would conjecture that this decomposition 
can also be treated, but this is not considered here. The main result of this section is the following 
theorem, whose proof is postponed. 

Theorem 3.2. Assume (tfl}j)-(H^). Then there exists c\ < 00 and C2 < 00 depending only on the 
quantities in (IJTty-(IJ$) such that for all i£X, 



N > a (n + 1) > <f>(x) 



E 



7^(1) 

7n,x(l) 



- 1 



< c 2 — (n + 1) ■ 
~ N 



,2 + £ 



(x) 



K{x) 



with 



<t>(x) := c x 

and where Bq and B\ are as in Theorem 



Si 



log 



B; 



v(x) 



h (x) 



3.3 Construction of the Proof 

In the following Section, we detail the argument to prove Theorem I3.2I To that end, we present 
the essence of the argument with Proposition I3.ll and Lemma I3.ll below; the proofs of which are in 
Appendix [Bl along with some supporting results. 

The proof of Theorem 13.21 is constructed in the following manner. B y Theorem | 3.1 | we h ave the 



Cerou et al 



201 lj focuses 



decomposition (13.21) in terms of the operators The proof in | 

upon controlling these expressions via the regularity conditions mentioned in section 11.21 our proof 
will do the same, except under (L{T])-(tG]). 

Throughout the remainder of this paper, let V* : X — > [1, 00) is defined by 

V*(x) := K(a;)(l + e)-Iog/io(a;) + bg||holU+.). ( 3 - 4 ) 
where e is as in (E(5]). We proceed with the following key proposition. 
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Proposition 3.1. Assume (BJty-(BJ^). Then there exists c < oo depending only on the quantities in 
(BJ^-(I{^) such that for all n > 1, < s < n + 1, (i±, ...i s ) £ l n . s , F € C v i/2 2 <md ieX, 



r&~*->(f) < ||F|| ul/2 , 2 c s + 



i "0*0 



E, 



/i (a;) 



E x [1/ho (*„)]' 



(3.5) 



iwt/i the conventions that the product in the numerator is unity when s < 1, and in the case of s = 0, 
i s = 0. In the above display, v is as in (HE), ho £ C v is the eigenfunction as in Theorem \ 2.2\ and 
v* = e v is as in (|3.4p . 

This result of Proposition l3~Tl connects the operators |f i* 1 ^'" 43 ' j with expectations of the Lyapunov 
functions v and v* and the eigenfunction, w.r.t. the twisted chain. Given this result, one needs to 
control the numerator and denominator. The latter can be achieved by the MET of Theorem 12.21 and 
the former via the following: 

Lemma 3.1. Assume (B^j)-(B^). Then there exists c < oo depending only on the quantities in (H^)- 
(B^j such that for any n> 1, 1 < s < n + 1, (ii, i s ) € T- n ,si 



E, 



n v (x k ) v * (x n+1 ) 

fe£{<i,...,i s } 



< c s+L v*(x), VieX, 



(3.6) 



where v* is as in (|3.4p . 

We now proceed with the proof of Theorem 

Proof. [Proof of Theorem 13. 2| By Proposition 13.11 and Lemma 13.11 we have that there exists a finite 
constant c depending only on the quantities in (t(T|)-(]r(5|) such that 



M*) ' t x [i/h Q {x n )\ 



(3.7) 



Using the fact that E a [l/h (X n )] = J n ,x(^)/[^ n ho(x)] we appeal to (|23)) of the MET of Theorem 
21 as follows. Without loss of generality, it can be assumed that Bq > 1. Then for all x E X 



n > 



— log 



R 2 «(*) 



h (x) 



=> 1 - B e~ 



3m v ( x ) > B - 1 



ho(x) 



Bn 



l x [l/h (X n )] > ^-i. (3.8) 



Throughout the remainder of the proof the left-most inequality in (|3.8p is assumed to hold. Then 
combining Q3.8P with (|3.7p and recalling the definition of v* we have that there exists Co < oo such 
that 



f(ti,...t„) /I ^ i\ < A , ir s+i t;2+£ ( a: ) 



Proceeding by the essentially the same argument as in 



Cerou et al 



2011 



Proof of Theorem 5.1], we 
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use the identity: 



n+l 



E E 



n 



s=l (ti,...,*„)eZ n ,„ie{«i 1 ... 1 i»} 



n( i+ ^) 



s=0 



which holds for any n > 1 and {a s ; s > 0}, to establish via Theorem 13.11 that 



E 



JV 



7^(1) 

7n,x(l) 



< C C 



n+l , 



v 2+e (x) / 1 



(n+l) -a 
N ) N s 

n+l 



1 



N — 1 



E ^ 

(i lr .,! a )£l„.« 

n+l 

- 1 



V 2+e (a;) 



1 



JV- 1 



Cerou et al 



Then exactly as in 

N > 1 + c(n+ 1) 
This completes the proof. 



2011 



K{x) 

Proof of Corollary 5.2], 

n+l 



n+l 



- 1 



1 



N - 1 



-1< J^J c ( n+1 ) ^ ^ c ( n + 1 )- 



□ 



4 Examples 

This section gives some discussion and examples of circumstances in which the assumptions can be 
satisfied. In particular we focus on the drift assumption of (fE|). It seems natural to consider two 
general cases: those in which it is not assumed, or it is assumed, that the Markov kernel M itself 
satisfies a multiplicative drift condition. 



4.1 Cases without a multiplicative drift assumption on M 

In this situation, the decay of the potential function plays a key role in establishing the multiplicative 
drift condition, illustrated as follows. 

Lemma 4.1. Assume that there exists V : X — > [l,oo) unbounded such that \\\M\\\ V < oo and for 
all d > 1, C'd is (1, e<2, Udj-small for M, with Vd(Cd) > and M(Cd){x) > for all x. If for all 
d>l, ini xe c d U{x) > —oo, and there exists d\ such that sup^g^ U(x) < oo and for some Si £ (0, 1), 
su PaeC c U(x)/V(x) < —Si, assumption (iWj) is satisfied. 



Proof. We have 



Q (e v ) (as) < exp (V{x) + U(x) + log |||Af||| ) , Vz e X. 



As V is unbounded, for any S £ (0, Si) there exists d large enough such that for all x £ X and d > d, 

I Ci (x)Q(e v ) (x) <exp(V(x)(l-S)), I Cd (x)Q (e v ) (x) < exp (d + sup U(y) + log \\\M\\\ V ) , 

\ yec d / 
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which is enough to verify the drift part of (A2). The minorization condition with mo = 1 and the 
Q{Cd){x) > part are direct as U(x) is bounded below on Cg. □ 

In the extensive literature on Lyapunov drift for Markov kernels there are several conditions which 
immediately guarantee the ex istence of v such that |||M HL < oo. For example, any M satisfying the 
polynomial drift condition of iJarner and Roberts automatically satisfies |||M|||„ < oo for the 



same v up to a factor of e. However, ergodicity of M is not necessary, as illustrated in the following 
simple example. 

4.1.1 Gaussian Random Walk 

Let X := K and U and M be defined by 

U(x):=-x 2 , M(x,dy) := — 1= exp |- ^ ^ j dy, 



The numerical results of Figure 14.11 show estimates of 



with fixed N 



where dy denotes Lebesgue measure. Taking ip as Lebesgue measure, the ^-irreducibility and aperi- 
odicity of {Q n ;n > 1} is immediate. For the drift and minorization conditions of (EG), elementary 
manipulations show that equation (j2.2[) holds with V(x) = x 2 / (2 (1 + So)) + 1 for suitable do > 
and solutions of the minorization condition (|2.1[) are also easily obtained. Condition (Ej3) is trivially 
satisfied because U is non-positive. The density assumption (Ej4) is satisfied with fid proportional to 
the restriction of Lebesgue measure to Cd- Assumption (E(S) holds for e small enough and eo = e/2. 

It is generally not easy to obtain or estimate values for the constants in Theorem 13.21 In all the 
numerical examples which follow, we consider a fixed value of N and consider the relative variance as 
a function of the n and the initial condition x. 

r 'iiAi) ; 

Jn,x{ 1 ) , 

2000, for various x and n, with in each case the expectation approximated by averaging over 2 x 10 
independent simulations of the particle system. For this model 7 niX (l) can be computed analytically, 
and this exact value was used in the estimates. The linear growth of the relative variance and its 
dependence on the initial point x is apparent from the figure. 

4.2 Cases with a multiplicative drift assumption on M 

The following Lemma shows that condition (E(5) holds for suitable U when M itself satisfies a multi- 
plicative drift condition. 

Lemma 4.2. Assume that there exists V : X — > [l,oo) unbounded, 8i > 0, d± > 1 and for each d>d\ 
there exists bd < oo such that 

M(e v ) < e v<i-*0+Mo is (4.1) 

and the set Cd = {x; V(x) < d} is (1, e^, Vd)-small for M , with Vd(Cd) > and M(Cd)(x) > for all 
x. Then if U + G Cv , hm r ^oo 1 1 Hc= 1 1 ^ = and for all finite d, inf^gCd U(x) > —oo, assumption 
holds. 
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Figure 4.1: Top: Gaussian random walk model. Bottom: ergodic autoregression model. Left: Relative 
variance vs. initial condition x , at times □, n = 20; x , n = 40; *, n — 60; o, n = 80; o, n = 100. Right: 
Relative variance vs. n, from initial conditions (dashed) xo = 0, (solid - top) x = 4, (solid - bottom) 
x Q = 10. 
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Proof. Due to the drift condition (|4.1[) . for any S € (0,#i), 



Q(e v ) < exp(V(l-S)-(S 1 -6)V + U+ + b d l Cd ) , 
and due to lim r _ ) . 00 ||lcc[/ + || = 0, there exists d such that for all d > d, 

Q(e V ) <exp(V(l~S) + b d lc d ), 

where bd := bd + d\\U + \\ v , which verifies the drift part of (EJ2J). The minorization condition with 
mo = 1 and Q(Cd){x) > part are direct as U(x) is bounded below on Cd- □ 

4.2.1 Ergodic Autoregression 

Let X := K and U and M be defined by 

U(x):=\x\, M(x, dy) := -L exp (- {v ~ ^ J dy, 



for fixed \a\ < 1. Elementary manipulations then show that, for Sq > and d large enough, M 
satisfies (|4.ip with = x 2 / (2 (1 + Sq)) + 1. As per the random walk example, M readily admits 

minorization on the sublevel sets Cd- 

The potential function U clearly satisfies (E(3j) . Lemma 14.21 shows that (EE]) is satisfied. The 
density assumption (Ej4]) is satisfied for fid proportional to Lebesgue measure restricted to Cd- Again 
it is straightforward to check that (t(5j) is satisfied for e > small enough and eo = e /2- 

Figure 14.11 also shows estimates of the relative variance obtained by simulation for this model with 
a = 0.4 and using N — 10 4 particles, averaged over 10 4 independent realizations. Again the linear 
growth of the variance is apparent, but there appears to be less variation with respect to the initial 
condition than in the random walk example. 

4.2.2 Cox-Ingersoll-Ross Process 



The Cox-Ingersoll-Ross (CIR) process, [Cox et all Il985l |. is a diffusion process that is typically used 



in financial applications to capture mean-reverting behaviour and state-dependent volatility, which is 
thought to occur in many real scenarios. The process is defined via the stochastic differential equation: 

dX t =9(jj,- X t ) dt + ayfx t dWt 



where {Wt} is standard Brownian motion, 9 > is the mean-reversion rate, /i > is the level of 

a 

mean-reversion and a > is the volatility. We assume that — — > 1 so that the process is stationary 

a 2 

and never touches zero. 

Throughout the remainder of section 14.2.21 for A > we denote by M A the transition probability 
from any time t to t + A of the CIR process with parameters 9, /i, a. The following lemma identifies a 
drift function for M A , exhibiting a trade-off between growth rate of the drift function specified by a 
parameter s, the parameters of the CIR process and the time step size A. 



18 



Lemma 4.3. For s > and A > 0, consider the candidate drift function V : M.+ — > [1, oo), defined by 

AOsx 



V{x) := 1 + 



CT 2 (1 _ e -9A) 



Then subject to the conditions: 
l-e- eAs 



se o 



s G 0,1 



-6A 



2 / ' V l-2s, 

t/ie following multiplicative drift condition is satisfied: 

M A (e v ) < e v ^ +b ^, 
with V as in an d bd '■= j- log (1 — 2s) + 1. 



d> l-2^1og(l-2s)/^ 



l-e- eA /(l-2s)-S 



1 - 2s o~ 



Proof. For t > define 



•2(9 



_ 46>^ 



(4.2) 



(4.3) 



and the scaled process Z t := 2c f JT t . Conditional on Xo = a;, Z t has a non-central chi-s quare distr i bution 
with degree of freedom k and non-centrality parameter taking the value 2ctxe~ et 
We then have for any ieX, 



Cox et al 



M (e v ) (x) 



l x [exp (sZ A )} exp(l 
-6A 



= exp 
< exp 



2caxs 



1 - 2s 

-0A 



log(l-2s) + l 



noifr* -|io g (i- 2s)+ i 



where the equalities hold due to the existence of the moment generating function [exp(sZt)], for 
s < 1/2, which is satisfied under the conditions of (|4.3[) . Under these conditions we also then have for 
d > d and x 4. Cd, 



M A (eM(x) < exp 



V(x) (1 - 8) - d 1 



-6A 



1 - 2s 



log (1 - 2s) + 1 



< exp[V(x)(l-S)} 



and for x G Cd, 



M (e v ) (x) < exp 



e -eA 
1 - 2s 



-log(l-2s) + l 



exp (b d ) 



□ 



We will consider as an example the case where the Markov chain {X n } is the skeleton of the CIR 
process over a discrete time grid of spacing A and U(x) := a log a; for some fixed a. Lemmata l4.2l and 
|3] establish that (E(2])-(t(3]) are satisfied and one can check (H1])-(H5]) are satisfied similarly to the 
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Figure 4.2: Cox-Ingersoll-Ross process. Relative variance vs. n, from initial conditions xa — 0.1 
(dashed), xq = 1 (solid - bottom), xq — 3 (dot-dashed), xq = 10 (solid top). 



previous example. 

Figure l4~2l displays estimates of the relative variance for this model, computed via simulation, when 
A = 0.01, (i.e. M = M 01 ), a = 0.01, 6 = 10, fj, = 1, and a = 0.1. This was obtained using N = 10 3 
particles, averaged over 3 x 10 3 independent realizations. Again the linear growth of the relative 
variance is present for different initial conditions. Note one may interpret 7100, x(l) as the geometric 
mean Ez[nfc=o X^ 100 ], which can be used for prediction in a variety of financial applications. 



5 Summary 

In this paper we have established a linear-in-n bound on the non-asymptotic variance associated with 
particle approximations of time-homogeneous Feynman-Kac formulae, under assumptions that can be 
verified on non-compact state-spaces. 

There are several possible extensions to this work. Firstly, to consider non-homogeneous Feynman- 
Kac formulae, which occur routinely in applications such as filtering and Bayesian statistics. Secondly, 
an important developing area in the analysis of se quential Mon t e Car lo methods is the case when 



the dimension of the state-space can be very large [Beskos et all l201l| . Such analysis has relied on 
classical geometric drift conditions and it would be interesting to consider the role of multiplicative 
drift conditions in this context. 
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A Proofs and Auxiliary Results for Section [2] 

Proof. [Proof of Lemma [2~Tj Fix any d > d. The upper bound of (|2.3I) is an immediate consequence of 
the inequality Q (e v ) / e v < e bd , implied by (|2~2"j) . 

For the upper bound of (|2.4j) . use the standard inequality (Q) < \\\Q\\\ V and then also due to the 
drift condition in (|2.2|) . |||Q|||t, < oo. Now consider the lower bound. It is claimed that for any k > 3 
and 1 < j < k — 1, 

Q kmo (e v )(x) > Q (fc _ i)tno (IcJ {xy d v d {df- 1 v d (e v ) , Vx e X, (A.l) 

where mo is as in For each fc, the claim is verified by induction in j; fix k > 3 arbitrarily. For 

3 = 1, 



Qkm (e V ) (x) > Q( k -l)m i^C d Qm {& V )) (x) > Q(k-l)m (IcJ {x)t d V d 



which initializes the induction. Now assume that (|A.1|) holds at rank 1 < j < k — 1. Then at rank 
j + 1, applying the induction hypothesis 

Qfcmo (e^) (x) > Q(k-j-l)m { l C d Q mo (IcJ) {x)e d v d (C d ) J ~ v d (e V ) 
> Q(k-j-i) mo (IcJ (x)e d +1 u d (C d ) j v d (e v ) , V.eX, 



where (|2.1|) has been applied, thus the claim is verified. 
Now applying (jA.ip with j = k — 1 gives, 

Qfcmo (e^) (g) gmoJIgJW fc-l , r ^fc-2 , » y w 

— — - — ^) — e rf ^ d ^ ^ d \ e > > °' Vx G X ' 

which implies that 



iiiQ^oiiiy (fcmo) > 4 _1/(fcmo) ^ (Cd) l_2/(fcmo) (e^) 1/(femo) 



sup Qfe)(x)l 1/(fcroo) 



Taking fc — > oo is enough to verify (|2.4p . as lim r i_ >00 |||Q n |||w always exists by subadditivity. □ 

Proof. [Proof of Lemma [272] Set r > d arbitrarily and let := lc r Q- For n > 1, denote by Q„ the 
n-fold iterate of Q^. 
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Then under (IE), 



5*0+1 



ra=0 



= E a 

< exp (rt \\U + \\ V 
and therefore under (EEJ), 



J| I Cr (X„) exp ((7 (X„)) I A (A to+1 ) 

to 

H He, (X„) exp (C/ (X to )) I A (X to+1 ) 



E, 



, Va;eX,AeB(X), 



5*0 + 



<(3* r (A) :=exp(rt Q \\U+\\ v ) / f3 r (dy)Q(y, A), Vx e X, A e B(X) 



(A.2) 



c. 



Lemma B3 of 



Kontoviannis and Mevnl . 12005^ then implies that Q 2 '* +2 i s w-separable. 
In order to establish that Q2t +2 is v-separable, we will prove that |||Q2* +2 — Qat^+alll" can ^ e 
made arbitrarily small through suitable choice of r. By decomposing the difference Q2t +2 — 62*0+2 m 
a telescoping fashion and applying the sub-additive and sub-multiplicative properties of the operator 
norm we obtain: 

2to + l 

|||Q2t +2 - Qata+allk - E IH^2*o+2-(n+l) ( 3™+ 1 _ Q2t +2~nQn III v, 
n=0 

2to + l 

< III Q - Q [r) III. E IIIQ2*o+2-(n+l) HI- IHOn HI" ' ( A ' 3 ) 
n=0 

Now for any n > 0, sup r |||Q„ \\\ v < |||Qn|||o < 00, where the final inequality follows from equation 
(|2~5j) of Lemma O and by we have |||Q - \\\ v -> as r -+ 00. Therefore it follows from (|A~3]) 
that |||Q2*o+2 — IIIk + ^ r + °°i so we conclude that Q2t +2 is w-separable. This completes 

the proof. □ 

The following lemma considers the twisted kernel P defined in (|2.10p . 

Lemma A.l. Assume (Ijlty-(I^). Then there exists S G (0,5), d > 1 and for any d > d , there 
exists bd < 00 such that 



p 



sup e 



e 



< 00, 



(A.4) 
(A.5) 



where V : X — > [l,oo) is defined by V(x) := V(x) — log/io^) + log ||/io|L. Furthermore, there exists 
p < 1, depending only on do and do, and for any d > do there exists b' d < 00 such that 



P 



(■') 



(A.6) 
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Proof. Under the assumptions of the lemma, we have already seen via Kontoviannis and Mevnl . 
Proposition 2.8] that the twisted kernel is well defined. First consider, (IA.4lk under (EQ, setting 
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So E (0, S), for any d > d, 

< exp (1/ - log ho - S V - {§ ~ 6 )V - log A + b d lc d ) ■ 

As V is unbounded, there exists do such that for all d > do, equation (j A.4|) holds with b d := bd — log A. 
For (|A.5jl by iteration of the eigenfunction equation, we have that for any d > do, 

h {x) = \- mo Q mo {h ) (x) > e d v d (ho) , VxeC d 

where we apply the minorization part of (EE]) to obtain the inequality. 

It remains to establish (|A.6[) . First considering the case x £ Cd, (| A.4|) implies that P ( e ^) ( x ) < 
e v(x)-8 v(x) < e v(x)-s d SQ that ([X6ll holds with p := e - * * 3 . For x € C d , equation ([A~4f shows that 
(|A.6|) with b' d := exp(d- loge d - logz/ d (/i ) + log ||^o||„)- □ 

B Proofs and Auxiliary Results for Section [3] 

In this appendix we detail the proofs and auxiliary results that are used in Section [3] The proofs and 
results are provided in a logical order; that is, each result at most depends on the preceding one(s). 
In particular, the proof of Lemma \3 . 1 1 follows the proof of Lemma TB.U 

Lemma B.l. Assume (I{^)-(IJ^).Then there exists p < 1, e? > 1 and for any d > d there exists 
bd < oo and b' d < oo such that 

pLv'\ < e v*-v+b d ic d 

P(e v ') < pe v '+b' d I Cd , (B.2) 

where V* is as in equation Jff.^p . 

Proof. Under the assumptions of the lemma, Theorem 12.21 holds, the eigenfunction ho £ C v , and the 
twisted kernel is well defined. Then under (BjSJ), we have for any d> d, 

< exp (V (1 + e) - log h a ~V~e V- log A + b* d l Cd ) . 

As V is unbounded, there exists do such that for all d > do, equation (|B.1|) holds with bd := b* d — log A. 
The proof of (|B.2|) then follows exactly as in the proof of Lemma [A. 11 □ 

Proof. [Proof of Lemma l3~T] We first consider some bounds on iterates of the twisted kernel. Standard 
iteration of the geometric drift condition in equation (|B.2[) shows that there exists a finite constant c\ 
such that 

8upP n (v*)(x) <civ*(x), xeX, (B.3) 

n>0 
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and then due to the multiplicative drift condition in equation (jB.lj) . 



supv(x)P n (v*) (x) = supv(x)PP n -i (v*) (x) < c 1 v(x)P(v*) (x) < cv*(x), x € X, (B.4) 

n>0 ri>0 

where c := c\e bd . 

In order to prove ()3.6j) first fix arbitrarily n > 1, 1 < s < n + 1 and (ii, . . . ,i s ) € T n s . The 
proof is via a backward inductive argument through the coalescent time indices. Assume that at rank 
1 < 3 < s, 



v(x)E x 

Assuming (|B.5[) is true, then at rank j — 1 



II v (x fe ) ^ (x n+1 _ i3 .; 



< c s+1 ^'t;*(x) 



(B.5) 



v(x)E x 



k&^ij—4j — -L 1 ...,i s —ij—i} 



v(x) / P ij - ij _ 1 (x,dx )v(x )E X > 



n «(A fc )«* (x„+i_ l3 ) 

+ i — ?j ,...,2 S — ij } 

<c s+1 -^(x) / (x,dx')v*(x') 

where the final inequality is due to equation (jB.4[) . Furthermore 

v(x)E x [v* (Xn+i-%,)] = v(x)P n+ i- is (v*) (x) < cv*(x), 
where the inequality is again due to (|B.4|) and therefore at rank j = s — 1, 



v(x)E x 



H v (X k ) v* (A > „ +1 _ 

k—(i s — z s _ i ) 



= u(z) / Pi s -i s _ 1 {x,dx')v(x')E x , [v* (l n _, s )] 



< cu(x) / Pi s -i s _ 1 {x,dx)v*(x) 

< C V (X ). 

The above arguments prove that (|B.5[) holds at rank j = I and the proof of the Lemma is then also 
complete as n+ 1, 1 < s < n + 1 and (ii, . . . € 2,i, s were arbitrary. □ 

Lemma B.2. issume (7i{7|]-(7i{5p. Then there exists c < oo depending only on the quantities in 
such that for any n > 1 and : X — >• Kq", 



A 2 "DQ,r^ r) (.,:.,■') ; rr(, ■)/,„( .,-)/>„ ( £ ) (.,-). (.,-..,-') t X. 



(B.6) 
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where v is as in and A and ho € C v are respectively the eigenvalue and eigenfunction as in 

Theorem \2.2[ 

Proof. By standard iteration of the geometric drift condition in equation (|A.6[) of Lemma IA.11 there 
is a finite constant c such that 



supP„ (v) (x) < cv(x), ieX. (B.7) 



n>0 



Then due to the definition of the twisted kernel and v (see Lemma [A. ip . there exists a constant c such 
that for any n > 1, and <p : X — > M.^ , 

\~ 2n Qf (<p ® v) (x, x 1 ) = h (x)h (x')P® 2 (jj- ® (x, x') 

< ch (x)h (x')P® 2 ® uj 

< ch (x)P n (j^J(x)v(x'), (x,x')eX 2 , (B.8) 

where the final inequality is due to (|B.7[) . □ 

Lemma B.3. Assume Then there exists c < oo depending only on the quantities in 

f-fflp-f-^EP such that for any m > 1, n > and (x,x') € X 2 , 

X -Hm+n) D Q®2 D Qf ^1/2 g ^1/2^ ^ ^ < ci^V*)^ [«* (j£ ro )] • 

Proof. Throughout the proof c is a finite constant whose value may change on each appearance. 
When n = 0, 

X-*(<n4*)DQ®*DQ* 2 (y 1 '*<av L ' 2 ) {x,x') = \- 2( - m +^Qf 2 (d {v 1 ' 2 ® v 1 ' 2 )) (x,x) 

= \- 2 ( n + m )Q® 2 (v®l)(x,x) 

< cv(x)h (x)P m (j^-) ( x ) 

< cv(x)h (x)P m (v*) (x) 

= cv(x)h (x)E x [v* (X m )] , 

where the first inequality is due to Lemma lB.21 and the second inequality is due to the definition of v* . 
Now consider the case n > 1. We have 

\- 2n DQf 2 (v 1/2 ® v 1 ' 2 ) (x, x 1 ) < cv(x)h (x)P n (jj^j (*) 

< cv(x)h (x)P n («*) (x) 

< cv(x)h {x)E x [v* (X n )] v{x'), 



where we have used v > 1, Lemma IB. 21 with ip = v, the definition of w* and again v > 1. A further 
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application of Lemma IB. 21 with ipjx) — v(x)ho(x)lt x [v* (X n )~\ and an application of Lemma [3TT] yields: 



\-^ m +^DQ® 2 DQ® 2 (V/2 g, w i/2) ( a . i a,) < ^(^(a;)^ [% m [« (X ) «* (!„)]] 

< c 2 U (x)/ l0 (x)E x [»* (X m )} . 



This completes the proof. 



□ 



Proof. [Proof of Proposition 13. 1| The starting point of the proof is to write, using the definition of the 
twisted kernel, 

\-2ttTi('li".,«»] /p\ \-2ri-n( i i.---. i s) / F \ 

p(ii,...ia) (p) — A Ln - X [fj _ A 1 n,x (r ) 

n ' X ; ' A-2» 7 „^(l) 2 ^(^[l/fto^)] 2 ' 
Thus in order prove (|3.5[) . we need to prove 



\- 2n h^(x)T^-^ (F) < \\F\\ vl/2 2 c s+l ^t z 



l[ v (X k ) v* 

fee{ii,...,i s -i} 



(B.9) 



for each n > 1, < s < n+1 and each possible configuration of the coalescent time indices (ii, i s ) £ 
X ll:S . We will consider first the case s > 1 and then s < 1. Throughout the remainder of the proof, c 
denotes a finite and positive constant, whose value may change on each appearance but depends only 
on the constants in (L(T])-(Lj5]). 

Consider the case s > 1. It is claimed that there exists a finite constant c such that for any n > 1, 
(x, x') G X 2 , F £ £^1/2 2 , 1 < s < n + 1, and any (ii, . . . , i s ) S 2n jS , 



'_^ +l r(.,-)/, l ,(.r)E, [] 

fe£{i2— ii,...,is-i— h} 



(B.IO) 



with the convention that the product is equal to unity when s = 2. For a given n, the claim is proved 
by backward induction through the coalescent time indices. The inductive hypothesis is that at rank 
1 < j < s - 1, 

x -2 { n-i j)DQ m^ ..DQf 2 _ is _DQ®l is (V/2 8 u i/a) 

< c^+^/ioO^ ]J v (X k ) v* (X ia ^) , (B.ll) 

fcE{ij-(-i— ij,...,i s — i— ij} 

with the convention that the product equals unity when j + 1 = s. 

To initialise the induction, we have at rank j = s — 1 that the left hand side of (|B.11I) is 

and Lemma [B.3I then shows immediately that (|B.11|) does indeed hold at rank s— 1. We point out that 
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the constraint F g C v i/2 2 in the statement of the proposition is imposed because in the case i s = n 
we immediately encounter DQ® 2 _ iB {v 1 / 2 ® v 1 ^ 2 ) — D(v x l 2 ® v 1 / 2 ) — v, and we can control integrals 
involving v using the drift conditions, as in Lemma IB. 31 If we were to give a separate treatment of 
r[''i'"' ls ' (F) for coalescent time configurations in which i s ^ n, the constraint on F could be relaxed 
to a larger function class. 

Proceeding with the induction, when the hypothesis (jB.llj) holds at rank j, we have at rank j — 1: 



X-^-^DQ^ . . . DQf 2 _ lB _DQf 2 u (« 1/2 ® v^ 2 ) (x, x>) 



k£{ij — ij_i,...,i s _i— i$—x} 



n 



t>(l fc )i;* (l'i s _i 3 ._ 1 



where the inequality follows from applying the induction hypothesis, then multiplying by v(x') > 1 
and then applying Lemma IB . 2 1 with (p(x) the ir-dependent part of the right hand side of (|B.11|) . This 
concludes the inductive proof of (jB.lOj) . 

Consider the case s > = 0. Multiplying the right hand side of (|B.10[) by v(Xo) — v(x) > 1 
and recalling the definition of Tn*x "' ls ^ and 7^ = S x , we immediately obtain (|B.9[) ■ as desired. In the 
case i\ > 0, we multiply (|B.10j) by v(x') and apply Lemma [B. 2 1 in a similar fashion as before to yield 

X- 2 -DQf 2 DQf 2 _ n ...DQl\_DQf 2 la M 
< c s+2 v(x)h (x)E x 11 v(X k )v*(X ts ) 

fe£{il,...,i»_l} 

so again we obtain (|B.9j) as desired. This completes the treatment of the case s > 1. 
For the case s = 1, i\ > 0, 

\- 2n Qf 2 DQt 2 ti (V/ 2 ® v 1 ' 2 ) (x, x) = \- 2n DQl 2 DQ® 2 n (v 1 ' 2 ® v 1 ' 2 ) (x, x') 

< cv{x)h (x)E x [v* (X n )] , 

where the inequality is due to an application of Lemma IB.3I Thus we have (|B.9|) in the case s = 
1, i\ > 0. It only remains to address the case s = 0, because for the case s = 1, %\ = we observe that 
T { n} x (F) = T'hl (F). 

For s = we have T^} x (F) = 7® % (F) = Q® 2 (F) (x, x) < \\F\\ vl/2 ^ Q® 2 {v ® v)(x, x) and therefore 
(recall v from lemma [A. 1[) 

\- 2n h 2 (x)Tl% (F) < \\F\\ vl/2<2 \- 2n ho 2 (x)Q®\v®v)(x,x) 
< c\\F\\ vl/2i2 P® 2 (v®v)(x,x) 
v(x) 



< cWFll, 



»i/2,2 



h (x) 



v*(x). 



(B.12) 
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where the final inequality follows by iteration of the geometric drift condition (|A.6|I and the definition 
of v*. Thus (|B.9j) holds in the case s — 0. This completes the proof of the proposition. □ 
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